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SYSTEM AND METHOD FOR EFFICIENTLY FINDING NEAR-SIMILAR 
IMAGES IN MASSIVE DATABASES 

ABSTRACT OF THE DISCLOSURE 

Massive amounts of multimedia data are stored in databases supporting web 
pages and servers, including text, graphics, video and audio. Searching and finding 
matching multimedia images can be time and computationally intensive. A method for 
storing and retrieving image data includes computing a descriptor, such an a Fourier- 
Mellin Transform (FMT), corresponding to a multidimensional space indicative of each 


O of the stored images and organizing each of the descriptors according to a set similarity 
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p 10 metric. The set similarity metric is based on Locality-Sensitive Hashing (LSH), and 
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P orders descriptors near to other descriptors in the database. The set similarity metnc 

employs set theory which allows distance between descriptors to be computed 
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s consistent with LSH. A target image for which a match is sought is then received, and a 

jjj descriptor indicative of the target image is computed. The database is referenced, or 

O 15 mapped, to determine close matches in the database. Mapping includes selecting a 
Q candidate match descriptor from among the descriptors in the database and employing a 

i* 

distance metric derived from the similarity metric to determine if the candidate match 
descriptor is a match to the target descriptor. 


